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PATTERN RECOGNITION APPARATUS AND METHOD USING 
PROBABILITY DENSITY FUNCTION 

Background of the Invention 
5 Field of the Invention 

The present invention relates to an apparatus for 
recognizing the patterns of characters and the like 
included in information of a recognition target and a 
method thereof. 

10 

Description of the Related Art 

A conventional typical character recognition 
method is as follows. First, a feature is extracted from 
an inputted character pattern as a set of numeric values, 

15 that is, a vector by a specific predetermined method. 
In other words, an input pattern is mapped to a point 
in a feature vector space by feature extraction. Then, 
the distance between the feature vector of this input 
pattern and the representative point of each category 

20 in the vector space is calculated and the closest category 
is designated as a recognition result. 

The representative point of each category in the 
vector space is the average in the vector space of sample 
pattern feature vectors prepared for each category. For 

25 the distance scale, a city block distance, a Euclid 



2 



distance and the like are used. 

^Hand-written Kanj i/Hiragana Recognition by a 
Weighted Directional Index HistogramMethod" by Tsuruoka 
et al. (Paper Journal D of The Institute of Electronics, 
5 Information and Communication Engineers, Vol. J70-D, 
No. 7, pp. 1390-1397, July 1987) proposes a method using 
modified Bayes discriminant function that reflects the 
distribution in a feature vector space of each category, 
instead of a simple distance. 

10 This method is obtained by modifying Bayes 

discriminant function, which is an optimal discriminant 
function when sample patterns are based on a normal 
distribution and both an average and a covariance matrix 
are already known, so as to solve theoretical and 

15 implementation problems . In this case, the problems are 
that the higher an order of an eigenvector of the 
covariance matrix, the bigger an estimation error, that 
a huge calculation amount and a huge memory capacity 
are needed, and the like. If the (n-dimensional) feature 

20 vector of an input pattern is assumed to be x, a Bayes 
discriminant function f c (x) for acategoryC andamodif ied 
Bayes discriminant function g c (x) are defined as follows . 
(1) Bayes discriminant function 

/ c (x) = (x-m,ys; , (x-m c ) + Iog|Z c | (1) 

25 m c : Average vector of category C 
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S c : Covariance matrix of category C 
(2) Modified Bayes discriminant function 

(2) 

5 ac 1 : i-th eigenvalue of S c 

v c 2 : Eigenvector corresponding to the i-th eigenvalue 

of S c 

k: Integer between 1 and n, including 1 and n 

However, the conventional pattern recognition 

10 described above has the following problems. 

Even if a modified Bayes discriminant function is 
used, the recognition accuracy of fonts greatly deformed 
compared with a Mincho style, which is the most popular 
font for Japanese, and characters in a document greatly 

15 degraded depending on an input/output condition is not 
good* If greatly deformed fonts are degraded, the 
recognition accuracy further decreases. 



Summary of the Invention 

20 It is an object of the present invention to provide 

a pattern recognition apparatus for performing a 
high-accuracy recognition process for a pattern with 
a specific feature, such as greatly deformed fonts and 
characters in a greatly degraded document. 
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The pattern recognition apparatus of the present 
invention comprises a calculation device and a 
recognition device, and performs recognition based on 
a probability density function that is defined for each 
5 category in the feature vector space of a pattern. 

When a set of difference vectors is generated by 
d calculating a difference between a feature vector of 

fU each pattern in a specific pattern set and an average 

Q feature vector of each correct category, an expected 

lY 10 value of a probability density function of a specific 

^ category is defined using a normal distribution with 

*ff an autocorrelation matrix of the set of difference 

HF vectors and a feature vector of an unknown pattern as 

M ; a covariance matrix and an average, respectively, as 

15 a probability density function and a discriminant 
function of the specific category is defined based on 
the expected value, the calculation device calculates 
a value of the discriminant function for the feature 
vector of the known pattern. 
20 The recognition device recognizes the unknown 

pattern based on the value of the discriminant function 
and outputs a recognition result. 
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Brief Description of the Drawings 

Fig. 1 shows the basic configuration of a pattern 



recognition apparatus of the present invention; 

Fig. 2 shows the generation of a discriminant 
function value; 

Fig. 3 shows the changing a distribution of a 
category; 

Fig. 4 is a flowchart showing the first character 
recognition process; 

Fig. 5 is a flowchart showing the second character 
recognition process; 

Fig. 6 shows the configuration of an information 
processing device. 

Fig. 7 shows storage media. 

Description of the Preferred Embodiment 

The preferred embodiment of the present invention 
is described in detail below with reference to the 
drawings . 

Fig. 1 shows the basic configuration of a pattern 
recognition apparatus of the present invention. The 
pattern recognition apparatus shown in Fig. 1 comprises 
a calculation device and a recognition device, and 
performs recognition based on a probability density 
function that is defined for each category in the feature 
vector space of a pattern. 

When a set of difference vectors is generated by 



6 



calculating a difference between a feature vector of 
each pattern in a specific pattern set and an average 
feature vector of each correct category, an expected 
value of a probability density function of a specific 
5 category is defined using a normal distribution with 
an autocorrelation matrix of the set of difference 
yp vectors and a feature vector of an unknown pattern as 

f|| a covariance matrix and an average, respectively, as 

5 a probability density function and a discriminant 

r? 10 function of the specific category is defined based on 

^ the expected value, the calculation device 11 calculates 

a value of the discriminant function for the feature 
HR vector of the known pattern* 

H> The recognition device 12 recognizes the unknown 

15 pattern based on the value of the discriminant function 
and outputs a recognition result. 

Aset of difference vectors is considered to express 
an error distribution peculiar to a pattern set, and 
its autocorrelation matrix includes information about 
20 the error distribution. An expected value that reflects 
the information of the error distribution is generated 
by using a normal distribution with this autocorrelation 
matrix as the covariance matrix, as a probability density 
function. Therefore, a discriminant function that is 
25 defined based on this expected value reflects a feature 
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peculiar to the pattern set. 

The calculation device 11 calculates the value of 
this discriminant function of the feature vector of the 
unknown pattern and outputs the calculation result to 
5 the recognition device 12. The recognition device 12 
judges the category of the unknown pattern, based on 
'"Jf the received discriminant function value, and outputs 

2> the judgment result. 

j&f According to such a pattern recognition apparatus, 

S 1 10 an unknown pattern with the same feature as the pattern 

s set can be recognized with higher accuracy. For example, 

fjj if a difference vector set is generated from a set of 

jj character patterns of a greatly deformed font, the 

rT recognition accuracy of an unknown pattern of the font 

15 can be improved. If a difference vector set is generated 
from a set of character patterns in a greatly degraded 
document, the recognition accuracy of an unknown pattern 
with a similar degradation can be improved. 

The calculation device 11 and recognition device 
20 12 shown in Fig. 1 correspond, for example, to the 
combination of the CPU (central processing unit) 31 and 
memory 32 shown in Fig. 6, which are described later. 

This preferred embodiment adopts a method for 
adding information peculiar to a font or deformation 
25 due to degradation to the conventional modified Bayes 
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discriminant function* First, if a set of specific input 
patterns is inputted, the difference between the feature 
vector of each pattern of the pattern set and the average 
feature vector of a corresponding correct category is 
5 calculated and a set of the obtained difference vectors 
is regarded as an error distribution peculiar to the 
tO pattern set. 

flj Then, if the feature vector x of an unknown pattern 

y is inputted, with respect to a point x in a feature vector 

fV 10 space shown in Fig, 2, the error distribution peculiar 

to the pattern set is designated as a probability density 
5^- function F e (x) * Then, a probability density function of 

*P the discriminant function of a category C is designated 

M* as F c (x), the expected value (similarity) of function 

15 F c (x) is calculated using F e (x), and the discriminant 
function value of category C is defined based on the 
obtained expected value . In this case, the expected value 
of function F c (x) corresponds to the area of the region 
20 shown in Fig. 2 and is calculated according to the 
20 following equation. 

Expected value of F c (x) = J F c (x) • F e (x) dx (3) 

Fig. 3 shows the changing distributionof a category 
in the case where such a discriminant function value 
is used. A new distribution 23 is generated from both 
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is used, A new distribution 23 is generated from both 
the distribution 21 of feature vectors of category C 
based on the original discriminant function and the error 
distribution 22 peculiar to a specific pattern set. 
5 Even if at this moment, the original distribution 

21 does not include the feature vector of the unknown 
pattern, the new distribution 23 includes the feature 
vector and category C can be adopted as the recognition 
result of the unknown pattern. Therefore, if the unknown 
10 pattern has the same feature as the pattern set 
corresponding to the error distribution 22, the 
recognition accuracy of the unknown pattern can be 
improved. 

Next, how to calculate a new discriminant function 
15 is described in detail. First, a function q(mi, m 2 , Ei, 
£2) that is obtained by summing a normal distribution 
of an average mi and a covariance matrix Z x weighting 
it with a normal distribution of an average m 2 and a 
covariance matrix S 2 , is calculated according to the 
20 following equation. 

= L 1 T^Pl-^^-^y^ 1 ( x ~™i)\ 1 f^pl—ix-m^Ll^x-m^ldx 

= ~ r exp(~~K] (4) 



1 2 
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However, in the above equation, 

K - mfc\ + m t 2 lT 2 l m 2 - (E^ + l 2 l m 2 )' (S" 1 + E" 1 ) " (^^i + Y^m 2 ) 

(5) 

A new discriminant function <|> c (x) can be defined 
as follows using this q(mi, m 2 , Ei, £2) • 

4)c(x) = -21ogq(m c , x, E c/ E) (6) 
x: (n-dimensional) feature vector of an input pattern 
m c : Average vector of category C 
E c : Covariance matrix of category C 

E: Covariance matrix of the error distribution of a 
pattern set 

q(m c/ x, E c , E) corresponds to the expected value 
of equation (3) . By removing constant terms from this 

<|>c(x), the following discriminant function <t> c (x) is 
obtained. 

0>, (x) = jw',E>. - (lT l x + £> e )' (E" 1 + E; 1 )' 1 (E^x + E>, ) + log %\ + log IE' 1 + E; 1 1 
= (x - m c ) f (E + E,)~ l (x - m c ) + log |E + Z.| 

(7) 

This O c (x) equals a discriminant function obtained 
by replacing E c with £ + E c in the modified Bayes 
discriminant function F c (x) of equation (1) . In the same 
way as the shift from a Bayes discriminant function to 
a modified Bayes discriminant function, the following 
new discriminant function v|/ c (x) is defined. 
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^^c^) { ii— ir -if 1 - ^1 ((- — , ) - -o 2 h fri ^ - fi ^ 

Yc { i'=l V Yc J J V J-I i=M J 

(8) 

7c 1 : i-th eigenvalue of I + Z c 

Zc 1 : Eigenvector corresponding to the i-th eigenvalue 

5 of I + I c 

1: Integer between 1 and n, including 1 and n 

In this case, both Yc 1 and z c x should be calculated. 
However, the calculation of all eigenvalues and 
eigenvectors for each category takes enormous time, 
10 Therefore, of the respective eigenvalues and 
eigenvectors of Z + X G , those in the higher ranks are 
calculated in descending order of eigenvalues, and the 
approximate values of both the eigenvalues and 

eigenvectors of E + S c are calculated at high speed using 

15 those eigenvalues and eigenvectors. 

In the following description, the affix C of a 
category is omitted for conveniences' sake, and yc 1 and 
Zc 1 are described as yi and Zi, respectively. Then, both 
Yi and an n-dimensional vector z± (i=l, 2, 1) can be 

20 calculated as follows. 

<Calculation algorithm of eigenvalues/eigenvectors> 

First, it is assumed that the higher-rank s 
eigenvalues of a covariance matrix E c and eigenvectors 
corresponding to the eigenvalues are a x >ot 2 > ... >a s (s<l) 
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and Vi, v 2 , v s , respectively. It is also assumed that 
the higher-rank t eigenvalues of a covariance matrix 
£ and eigenvectors corresponding to the eigenvalues are 
3i>p2> ... ^Pt (t<l, l<s+t) and Wi, w 2/ w t , respectively. 
5 Then, as shown below, it is assumed that a vector subspace 
covered by Vi, v 2 , v s and a vector subspace covered 
by Wi, w 2 , w t are V and W, respectively, and that a 
space obtained by summing V and W is U. 

V = {Vi, v 2 , v s } (9) 
10 W = {Wi, w 2 , w t } (10) 

U = V + W - (11) 

Then, both n-dimensional vectors w r i, w' 2 , -r w 'u 
(s+u=l) and v f i, v' 2 , v' v (t+v=l) that satisfy the 
following equation are calculated by Gram-Schmidt 
15 orthogonal izat ion . 

U = {v x , v 2 , v s , w'i, w' 2/ w' u } 

= {v'i, v' 2 , »./ v f v , Wi, w 2 , w t } (12) 

Then, matricesV w , W v , AandBaredef inedas follows. 
V w = (Vi v 2 ... v s w'i w' 2 *~ w' u ) (13) 
20 W v = (Wi w 2 ... w t v'i v' 2 - v' v ) (14) 
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a, 



a. 



A = 



a. 



a 



a 



a , 



(15) 



B = 



(16) 



However, it is assumed that a (=a s +i) and |3 (=pt+i) 
are already known. In this case, if a specific orthogonal 
matrix P is used, W v = V W P. Therefore, a matrix A+PBP 1 
is worked out in order to calculate the eigenvalues of 
Z + Z c in U, this matrix is diagonalized as shown below, 
and the obtained diagonal components are assumed to be 

Yi, Y2, - f Ys+u (Ys+ir=Yi) • 

A+PBP* = XTX t (17) 
X = (xi x 2 ... x £+u ) (18) 
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f 



r= 



(19) 



Ts+U J 



Then, z lf z 2/ .../ z s + u (z s +u = Zi) are calculated 
according to the following equation. 



Next, a specific example of a pattern recognition 
process using the discriminant function of equation (8) 
is described with reference to Figs. 4 and 5. 

For example, if characters in a document image that 
are inputted by input equipment are recognized, the 
following three character pattern sets are prepared 
assuming that the font type of an input document , the 
type of the input equipment and the resolution of the 
document image are F, I and R, respectively. 

S F : Set of the character patterns of font F 

Si: Set of character patterns inputted by input 
equipment I 

S R : Set of character patterns inputted with 
resolution R 

Then, the difference between each feature vector 
and the average feature vector of the category of each 
correct character (correct category) is calculated for 
each character pattern set, and the respective sets of 



(Zi z 2 ... 



z s+u ) - V W X 



(20) 
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each character pattern set, and the respective sets of 
obtained difference vectors for the three character 
pattern sets are assumed to be D SEV D S i and D S r. 

In this case, when a character of an unknown pattern 
5 of font F is recognized, the expected value of the 
p probability density function of category C is calculated 

5 using a normal distribution with the feature vector x 

iff? 

~1 of the unknown pattern and the autocorrelation matrix 

^ of difference vector set D S f as an average and a covariance 

^* 10 matrix, respectively, as the probability density 

Q function. Then, the discriminant function of equation 

M- (8) is calculated based on the expected value, the value 

p of the discriminant function for the feature vector of 

the unknown pattern is calculated, and character 
15 recognition is performed. 

When a character of an unknown pattern that is 
inputted by input equipment I is recognized, the expected 
value of the probability density function of category 
C is calculated using a normal distribution with the 
20 feature vector x of the unknown pattern and the 
autocorrelation matrix of difference vector set D S i as 
an average and a covariance matrix, respectively, as 
the probability density function. 

When a character of an unknown pattern that is 
25 inputted with resolution R is recognized, the expected 
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value of the probability density function of category 
C is calculated using a normal distribution with the 
feature vector x of the unknown pattern and the 
autocorrelation matrix of difference vector set D S r as 
5 an average and a covariance matrix, respectively, as 
the probability density function. 

When the combination of these three difference 
flf vector sets is used, first, at each point x in a feature 

Q vector space, the expected value fi (x) of the probability 

y[ 10 density function fo(x) of category C is calculated using 

jp^ a normal distribution with the autocorrelation matrix 

of set D S f as the covariance matrix, as the probability 

HF density function, 

S3 . 

Then, at each point x, the expected value f 2 (x) 
15 of function fi (x) is calculated using a normal 
distribution with the autocorrelation matrix of set Dsi 
as the covariance matrix, as the probability density 
function • Furthermore, at each point x, the expected 
value f 3 (x) of function f 2 (x) is calculated using a normal 
20 distribution with the autocorrelation matrix of set D S r 
as the covariance matrix, as the probability density 
function and this expected value f3(x) is designated as 
the new probability density function of category C. 

In this case, a corresponding discriminant 
25 function \|/ c (x) can be obtained by replacing the q(m c , 
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x, S c/ £) of equation (6) with the expected value fs(x) . 

Although in this example, three character pattern 
sets S F/ Si and S R are used, generally one or more arbitrary 
character pattern sets can be used and their error 
5 distributions can be reflected on the discriminant 
function* In the calculation of an expected value, it 
is not always necessary to use error distribution 
information in the order of sets D S f/ D S i and D S r, and 
these can be used in an arbitrary order. 

10 Fig. 4 is a flowchart showing a character 

recognition process using such three character pattern 
sets. The pattern recognition apparatus first obtains 
information about equipment used to input a document 
to be recognized (step SI) . For the equipment, a scanner, 

15 a digital camera, a facsimile device and the like are 
used. The pattern recognition apparatus obtains this 
information, for example, by automatically judging the 
type of equipment analyzing an image or by making a user 
input the type of equipment. 

20 Then, the pattern recognition apparatus obtains 

information about an error distribution corresponding 
to input equipment from error distribution information 
that is stored in advance, based on the obtained 
information (step S2) . Specifically, the higher-rank 

25 mi eigenvalues/eigenvectors of a covariance matrix 
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corresponding to set D S i are extracted as error 
distribution information. 

Then, the pattern recognition apparatus obtains 
information about the image resolution of the document 
5 to be recognized (step S3) . For the resolution, for 
example, 157 dots/cm, 118 dots/cm, 79 dots/cm and the 
like are used. The pattern recognition apparatus obtains 
this information, for example, by automatically judging 
the resolution analyzing an image or by making a user 
10 input the resolution. 

Then, the pattern recognition apparatus obtains 
information about an error distribution corresponding 
to resolution from error distribution information that 
is stored in advance, based on the obtained information 
15 (step S4) . Specifically, the higher-rank m 2 
eigenvalues/eigenvectors of a covariance matrix 
corresponding to set D SR are extracted. 

Then, the pattern recognition apparatus obtains 
information about the font type of the document to be 
20 recognized (step S5) . For the font, for example, thin 
mincho, intermediate mincho, bold mincho, thin Gothic, 
intermediate Gothic, bold Gothic, block style and the 
like are used. The pattern recognition apparatus obtains 
this information, for example, by automatically judging 
25 the type of a font analyzing an image or by making a 
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user input the type of a font. 

Then, the pattern recognition apparatus obtains 
information about an error distribution corresponding 
to a font type from an error distribution that is stored 
5 in advance, based on the obtained information (step S6) . 
Specifically, the higher-rank m 3 

eigenvalues/eigenvectors of a covariance matrix 
corresponding to set D S f are extracted. 

Then, the pattern recognition apparatus 
10 calculates the new eigenvalues/eigenvectors of each 
category by applying the calculation algorithm for a 
eigenvalues/eigenvectors described above to both the 
the higher-rank m 0 eigenvalues/eigenvectors of a 
covariance matrix of the fluctuating distribution of 
15 each category and the eigenvalues/eigenvectors that are 
obtained in steps S2, S4 and S6 (step S7) . 

Then, the pattern recognition apparatus 
calculates the discriminant function value of equation 
(8) using the obtained eigenvalues/eigenvectors and 
20 performs the modified Bayes discrimination (step S8) . 
Then, the pattern recognition apparatus outputs the 
recognition result obtained by the discrimination. 

Although in the character recognition process 
shown in Fig. 4, an error distribution is generated using 
25 a character pattern set prepared in advance, the 
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character pattern set can also be dynamically generated 
based on the inputted document to be recognized. 

In this case, the pattern recognition apparatus 
calculates the respective probability density function 
5 values of all categories for each character pattern in 
set S D of character patterns included in a specific input 
document D, and extracts the character patterns, for 
which the maximum value of the probability density 
function values is larger than a specific threshold value, 

10 as a character pattern with a correct character. Then, 
the pattern recognition apparatus calculates the 
difference between each feature vector and the average 
of each correct category for the extracted character 
pattern set and designates the obtained difference vector 

15 set as set D S d- 

Then, the pattern recognition apparatus 
calculates the expected value of the probability density 
function of category C for a specific character pattern 
of character pattern set Sd using a normal distribution 

20 with the feature vector x and the autocorrelation matrix 
of set D SD as an average and a covariance matrix, as the 
probability density function and calculates the 
discriminant function value of category C for x, based 
on the obtained expected value. 

25 Fig. 5 is a flowchart showing such a character 
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recognition process* The pattern recognition apparatus 
first recognizes characters in the document to be 
recognized by an arbitrary method (step Sll) • In this 
character recognition, a city block distance or a Euclid 
5 distance can be used. Alternatively, a modified Bayes 
discriminant function can be used. 

Then, the pattern recognition apparatus estimates 
the category of a correct character, based on the 
recognition result (step S12) . In this estimation, a 

10 character, for which the recognition distance value is 
equal to or less than a specific threshold, canbe regarded 
as a correct character. Alternatively, the recognition 
confidence degree can be calculated, and a character, 
for which the recognition confidence degree is equal 

15 to or more than a specific threshold, can be regarded 
as a correct character. 

Then, the pattern recognition apparatus 
calculates the covariance matrix of the error 
distribution with respect to the estimated correct 

20 character (step S13) , calculates the 

eigenvalues/eigenvectors of the covariance matrix and 
selects the higher-rank m eigenvalues/eigenvectors 
(step S14) . 

Then, the pattern recognition apparatus 
25 calculates the new eigenvalues/eigenvectors of each 
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category by applying the calculation algorithm for 
eigenvalues/eigenvectors described above to both the 
higher-rank m 0 eigenvalues/eigenvectors of a covariance 
matrix with respect to the fluctuating distribution of 
5 each category and the eigenvalues/eigenvectors that are 
obtained in step S14 (step S15) . 

Then, the pattern recognition apparatus 
calculates the discriminant function value of equation 
(8) using the obtained eigenvalues/eigenvectors and 
10 performs the modified Bayes discrimination (step S16) . 
Then, the pattern recognition apparatus outputs the 
recognition result obtained by the discrimination. In 
this case, the discriminant function values of all the 
character patterns can be calculated again. 
15 Alternatively, the discriminant function values of only 
characters that are estimated to be incorrect can be 
calculated again. An incorrect character corresponds 
to, for example, a character for which the distance value 
is larger than the threshold value or a character for 
20 which the confidence degree is smaller than the threshold 
value . 

The pattern recognition apparatus shown in Fig. 
1 can be configured, for example, using an information 
processing device (computer) shown in Fig. 6. The 
25 information processing device shown in Fig. 6 comprises 
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a CPU (central processing unit) 31, a memory 32, an input 
device 33, an output device 34, an external storage device 
35, a medium driving device 36, a network connection 
device 37 and input equipment 38, and the devices are 
connected to one another by a bus 39. 

The memory 32 is, for example, a ROM, a RAM and 
the like, and stores a program and data to be used for 
the process. The CPU 31 performs necessary processes 
by using the memory 32 and running the program. 

The input device 33 is, for example, a keyboard, 
a pointing device, a touch panel and the like, and is 
used for a user to input instructions and information. 
The output device 34 is, for example, a display device, 
a speaker, a printer and the like, and outputs inquiries 
and process results to a user. 

The external storage device 35 is, for example, 
a magnetic disk device, an optical disk device, a 
magneto-optical disk device, a tape device and the like. 
The information processing device stores both the program 
and data in this external storage device 35, and uses 
the program and data by loading them into the memory, 
as requested. For example, the external storage device 
35 stores in advance both the eigenvalues/eigenvectors 
of the covariance matrix with respect to the fluctuating 
distribution of each category and the 
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eigenvalues/eigenvectors of the covariance matrix 
corresponding to a difference vector set as data. 

The medium driving device 36 drives a portable 
storage medium 40 and accesses the recorded content. 
5 For the portable storage medium, an arbitrary 
computer-readable storage medium, such as a memory card, 
a floppy disk, a CD-ROM (compact-disk read-only memory} , 
an optical disk, a magneto-optical disk and the like 
are used. A user stores in advance the program and data 

10 in this portable storage medium and uses the program 
and data loading them into the memory 32, as requested. 

The network connection device 37 is connected to 
an arbitrary communications network, such as a LAN (local 
area network) , and transmits/receives data accompanying 

15 communications. The information processing device 
receives the program and data from another device, such 
as a server and the like, through the network connection 
device 37 and uses the program and data loading them 
into the memory 32. 

20 The input equipment 38 is, for example, a scanner, 

a digital camera, a facsimile device and the like, and 
inputs information, such as an image to be recognized. 

Fig. 7 shows computer-readable storage media for 
25 providing the information processing device shown in 
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Fig, 6 with a program and data. The program and data 
stored in the portable storage medium 40 or the database 
42 of a server 41 are loaded into the memory 32. In this 
case, the server 41 generates a propagation signal for 
propagating the program and data, and transmits the 
signal to the information processing device through an 
arbitrary transmitting medium in the network. Then, the 
CPU 31 performs necessary processes by running the 
program using the data. 

Although in the preferred embodiment described 
above, mainly character recognition is described, the 
present invention is applicable to arbitrary pattern 
recognition in which a probability density function can 
be defined for each category in a feature vector space. 
For example, the present invention is also applicable 
to a process for recognizing a pattern, such as a ruled 
line included in a document image, and the shape, color 
and the like, of an object included in the image of a 
graphic/photograph. Furthermore, information as a 
recognition target is not limited only to images, but 
any information sequentially inputted in a time series, 
such as voice, can also be used. 

According to the present invention, high-accuracy 
pattern recognition can be implemented for a recognition 
target, for which only extremely low recognition accuracy 
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can be implemented according to the conventional method. 
For example, in the case of character recognition, the 
recognition accuracy of a greatly deformed font compared 
with the most popular mincho style, a character in a 
document greatly degraded by an input/output condition, 
or a degraded character of a greatly deformed font can 
be improved. 



